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We report the draft genome sequence of Mycobacterium tuberculosis Beijing strain OM-V02_005, which exhibits possible hyper- 
transmissible characteristics among the population of patients with multidrug-resistant tuberculosis in Osaka Prefecture, the 
largest urban area in western Japan. 
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The Mycobacterium tuberculosis Beijing strain is a causative 
agent of tuberculosis with increasing prevalence, and it is as- 
sumed to have a considerable impact on resistance to major 
antituberculosis drugs. We identified the most prevalent clus- 
ter (OM-V02) among the population of multidrug-resistant 
tuberculosis (MDR-TB) patients in Osaka Prefecture, the larg- 
est urban area in western Japan ( 1 ) . To elucidate the underlying 
genetic characteristics of the transmissibility and drug resis- 
tance of this cluster, specifically found in MDR-TB patients, we 
performed whole-genome sequencing of one representative 
strain, OM-V02_005. 

We sequenced OM-V02_005 genomic DNA on a 454 GS-FLX 
Titanium Sequencer (Roche) and assembled the reads using New- 
bler version 2.3. A total of 510,688 reads was generated, with an 
average read length of 439 bp, yielding a total sequence of 
224,340,568 bp. 

The assembled sequences contained 134 contigs, and the 
length of all contigs combined was 4,32 1 ,797 bp, with a G + C ratio 
of 65.5%. The average coverage depth was 52 X , the JV50 contig size 
was 68,368 bp, the average contig was 32,252 bp long, and the 
longest contig was 184,273 bp. 

Genome annotation was performed using GeneMark.hmm for 
prokaryotes (http://exon.gatech.edu/gmhmm2_prok.cgi), xBASE 
(http://www.xbase.ac.uk/annotation/) for coding sequence genes, 
and tRNAscan-SE 1.21 (http://lowelab.ucsc.edu/tRNAscan-SE/) 
andRNAmmer 1.2 (http://www.cbs.dtu.dk/services/RNAmmer/) 
for RNA genes. OM-V02_005 has 3,888 coding-sequence genes, 3 
rRNAs (in a single operon), and 52 tRNA genes. 

We compared the OM-V02_005 sequence with that of other 
representative M. tuberculosis strains: H37Rv (GenBank accession 
no. AL123456.2), CDC1551 (AE000516), Ell (CP000717), 
CCDC5079 (CP001641), and CCDC5080 (CP001642). M. tuber- 



culosis CCDC5079 is a drug-sensitive strain and CCDC5080 is a 
multidrug-resistant Beijing strain sequenced in China (2). The 
reciprocal best-hit BLAST approach revealed that OM-V02_005 
shares 90.0%, 92.7%, 90.0%, 92.7%, and 88.7% of its coding se- 
quences with M. tuberculosis m7Rv, CDC1551, Fll, CCDC5079, 
and CCDC5080, respectively. 

OM-V02_005 has 89 specific genes. Most of these were classi- 
fied as genes for hypothetical proteins, but genes for both a dehy- 
drogenase enzyme and the Tuf-like elongation factor Tu showed 
specific sequence alignment between OM-V02_005 and a clinical 
strain of M. tuberculosis T46 isolated in San Francisco from a 
Filipino patient. 

We investigated single nucleotide polymorphisms (SNPs) us- 
ing SNPs Finder (http://snpsfinder.lanl.gov/). OM-V02_005 has 
270 specific SNPs, of which 169 were nonsynonymous mutations, 
66 were synonymous mutations, and 26 were mutations in pro- 
moter regions. In addition to our previous report of the mutations 
in rrs, rpoB, katG, gyrA, embB, andpncA ( 1 ), mutations were found 
in rrl as I236T and in gidB as A413E, respectively. No significant 
mutations were found in inhA, oxyS, iniB, iniA, iniC, furA, and 
rpsL (Rv0682). Nonsynonymous mutations include regulatory 
proteins for growth/dormancy regulation (pknA, mprA, and 
rpfA), intermediary metabolism and regulation {coaA, ribH, tal, 
and bfrB), virulence genes {clpB and mce3F), lipid metabolism 
(ppsB and ppsD), and cell wall proteins (IprK/mcelE, mmpLl, 
mmpL2, and dppA). The OM-V02_005 sequence data will pro- 
vide an invaluable resource to elucidate the underlying mech- 
anisms of transmissibility and drug resistance acquisition of 
M. tuberculosis. 

Nucleotide sequence accession numbers. The whole genome 
sequence of OM-V02_005 has been deposited in DDBJ/EMBL/ 
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GenBank under the accession numbers BARZO 1000001 through 
BARZO 1000 121. 
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